Embedding a Forest in a Graph. 

Mark K. Goldberg and Malik Magdon-Ismail 
Department of Computer Science, 
Rensselaer Polytechnic Institute 
Troy, NY, 12180. 
goldberg@cs.rpi.edu; magdon@cs.rpi.edu 

Abstract 

For p > 1, we prove that every forest with p trees whose sizes are ax, . . . , a p can 
be embedded in any graph containing at least X]f=i( a « + 1) vertices and having a 
minimum degree at least Yli=i a i- 

1 Introduction. 

It is a folklore fact that every tree with d > edges can be embedded in any graph with 
minimum vertex degree d. Indeed, a linear algorithm to find such an embedding would 
sequentially embed the vertices of the tree according to a depth first search ordering of 
the tree vertices. It is likely, though, that the required bound on the minimum degree is 
excessive, as captured by the famous conjecture by Erdos and Sos (JU), which states that 
every tree with d edges can be embedded in any graph whose average degree is at least d. 
A number of results (flTJ [51 [71 [H [6J) confirm the conjecture for some classes of trees and 
classes of graphs. The full conjecture is still neither proved, nor disproved. 

A natural extension of the problem is to embed a forest in a graph. If F = {T 1; . . . , T p } 
is a forest of p trees whose sizes are a l5 . . . , a p respectively, then a necessary condition for 
embedding F in a graph G is that |V((j)| > X)i=i(l + a «)- The straightforward tree em- 
bedding algorithm outlined above may fail, even if the minimum degree is at least Y%=i 
However, we show that this condition on the minimum degree (in addition to the obvious 
necessary condition) is sufficient to guarantee that the forest can be embedded in the graph; 
we prove the following: 

Theorem 1 Let F = {T 1; . . . , T p } be a forest, and d = X]f=i a i> wnere a i is the number 
of edges in the tree Tj (i G Then every graph G with at least d + p vertices and 

minimum degree at least d contains F as a subgraph. 
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Our proof can be converted to a quadratic algorithm for embeding a forest. 

We consider simple undirected graphs without parallel edges and loops. The set of 
vertices adjacent to a vertex x, the neighborhood of x, is denoted N(x). An embedding 
/ : H — > G of a graph H in a graph G is a one-to-one mapping / : V(H) — > V(G) such 
that for any two distinct vertices x,y £ V(H), if xy £ then f(x)f(y) £ E(G). For 

a graph iJ, the order of if is the number of its vertices (denoted \H\) and the size of H is 
the number of its edges. For the terms not defined in this paper see (BUl). 

2 A Proof of the Theorem U 

We prove the theorem by induction on p, the number of trees in the forest. We can assume 
that every tree in a forest has at least two vertices, so a* > 1. 

The Base Case, p — 1. The forest in this case consists of a single tree T x with d edges. 
We prove a slightly stronger statement, which implies the theorem for p = 1 . 

Lemma 1 Given a connected subgraph C ofTi and an embedding f : C — > G, there is an 
embedding g : T\ — > G whose restriction to C is precisely f. 

Proof: The idea is to arbitrarily grow the embedding / of C to an embedding gofTi. If 
|C| < d + 1, let uv £ E(Tx) be an edge such that u £ V(C) and v £ V(T\ \ C). Let 
w = f(u). Since C has at most d — 1 vertices other than u and since the degree of w in 
G is at least d, G has an edge wz with vertex z not in g(C). Thus, / can be expanded to 
g : C U {v} — > G by defining = g(a;) for all x £ C, and = z. Iterating this 
expansion completes the proof. ■ 

Corollary 1 For any vertex x of T\ and any vertex y of G, an embedding f : T\ — > G 
exists for which f(x) = y. 

The Induction Step, p > 1. Assuming the theorem holds for any forest F p _ x with p — 1 
trees, let F p be a forest containing p trees Ti, . . . , T p . Denote the size of Tj (i £ [l,p]). 
Let ai > a 2 > . . . > a p , and let a = a\. 

Assumption. For the purpose of deriving a contradiction, we assume that F p cannot be 
embedded in graph G satisfying the conditions of the theorem. 

Lemma 2 For every embedding g : T\ — > G, there is a vertex outside of g{T\) which is 
adjacent to every vertex in g(T\). 
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Proof: If the statement were incorrect, then the removal of g(Ti) from G would leave a 
subgraph G' with at least d + p — (a + 1) = 5^^ =2 (1 + aj) vertices each of degree at least 
d — a > Y^=2 a i- Inductively, {T 2 , . . . , T p } can be embedded in G' which would yield an 
embedding of F p in G contradicting the assumption that F p cannot be embedded in G. ■ 

The main use of the previous lemma is to show that under our assumption, there is a large 
clique in G. 

Lemma 3 G contains a clique of size at least a + 2. 

Proof: Let K be the largest clique in G and let \K\ < a + 2. Select any connected 
subgraph C of Ti of order \C\ = \K\, and embed C in K\ this is possible since K is a 
clique. By Lemma [U this embedding can be expanded to an embedding / of Ti in G, and 
by Lemma[2]there is a vertex outside of /(Ti) adjacent to all vertices in /(Ti). In particular, 
it is adjacent to all vertices in K, contradicting K 's maximality. Thus, \K\ > a + 2. ■ 

It turns out that for the rest of the proof, we only need a clique of size a. 

Lemma 4 Any tree of order a + 1 can be embedded in any connected graph of order at 
least a + 1 that contains a clique of order a. 

Proof: Start by embedding a leaf at a vertex outside an a-clique, but adjacent to a node 
in the clique (such a vertex must exist by connectivity). The remainder of the tree can be 
embedded in the clique. ■ 

Let K be a clique of size a in G. The subgraph G' = G\ K contains at least d — a + p 
vertices each of degree at least d — a. Inductively, F p _i = {T 2 , . . . , T p } can be embedded 
in G'. Let g : F p _i — > G' be such an embedding. Select any vertex x E K and a subset 
X C N(x) \ K with \X\ = d — a+1 vertices. It is possible since \N(x) \ K\ > d — a + 1. 

Lemma 5 Every vertex in X is used by any embedding g ofF p _\. 

Proof: Indeed, if x e X \ g(T p -i) is not used, then by LemmaHl T\ can be embedded in 
the subgraph H induced by K U {x}, which would yield an embedding of F p . m 

Since all d — a + 1 vertices of X are used in the embedding g : F p ^i — > G, exactly 
p — 2 vertices outside of K U X, denoted yi, . . . , y p -2, are used by g. The remaining m + 1 
vertices of the graph, outside of K U g(T p -i), are denoted s , si, . . . , s m . We now split the 
set of the trees of the forest into four subsets 71, T2, T3, and 71. 

7i: trees which are embedded entirely in X; 

72: trees whose embedding has at least two vertices in X and at least one vertex in Y; 
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trees whose embedding has only one vertex in X; and 
%: trees whose embedding is entirely in Y . 
Let qi — \%\ (i — 1, 2, 3, 4). Since every tree in F p _i belongs to one of these four subsets, 

9i + 92 + 93 + 94 = P - 1- 

Denote by a(Tj) the size of Tj. For the embedding g: every tree in % uses at least one 
vertex in Y; and, every tree T in 73 (resp. 71) uses a(T) (resp. 1 + a(T)) vertices in Y . 
Since there are p — 2 vertices in F, 

92 + XI a ( T *) + £ ( a ( T *) + X ) ^ P - 2 = gi + & + 93 + 54 - 1. 

This immediately gives a lower bound for q 1 . 

Lemma 6 9l > 1 + Et^tM^ ~ X ) + Eiie^ a ( T ») > 1 + 

Let s be an arbitrary vertex in S. Our goal now is to evaluate the degree of s in the subgraph 
induced on S, based on the assumption that F p cannot be embedded. We start with 

\N(s) r\S\>d- \N(s) HK\- \N(s) n(iu Y)\. (1) 

We make the following observations about the neighborhood of s in K U X U Y. 

1 . s is not adjacent to any vertex in K, else by LemmaHl T\ could be embedded in s U K. 

2. s is not adjacent to at least one vertex in g(T) for any tree T e T 2 U T 3 . Indeed, if s is 
adjacent to every vertex in T, a vertex g(Tj) which is in X can be swapped with s; this 
gives an embedding of F p _i that doesn't use every vertex of X, contradicting Lemma[5] 

3. s is not adjacent to at least two vertices of g(T) for any tree T e 7i. Indeed, let s be 
adjacent to all but one vertex in g(T), and let y = g(x) be that exceptional vertex. Then 
for every neighbor x' (in T) of x, s is adjacent to g(x'). By setting g(x) = s, we obtain 
a valid embedding of which doesn't use a vertex in X, contradicting Lemma[5] 

So, N(s) n X = and JV(a) n (X U Y) < \X U F| - (2q x + q 2 + q 3 ). Since \XUY\ = 
d — a + p — 1, we have from Inequality eq:l that the number of neighbors of s in S is at 
least: 

|iV(s)n5 , | > d-(d-a + p-l) + 2 qi + q 2 + q 3 
= a + q 1 -q i 
> a+1, 

where we have used 91 + 92 + 93 + 94 = V ~ 1 an d Lemma |6j Thus, the degree of any 
vertex s in the subgraph induced by S is at least a + 1. By Lemma[D 7\ can be embedded 
in this subgraph, contradicting the Assumption, and completing the proof of Theorem [TJ ■ 
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3 Conjecture 



When the number of vertices equals the lower bound p + d and the minimum degree is 
at least d, then the Hajnal-Szemeredi theorem on equitable coloring EH, applied to the 
complement of the graph, guarantees the existence of p cliques each of size at least |_ d/p J . 
Thus, an arbitrary p graphs of order at most [d/p \ can be simultaneously embedded in the 
graph. When the number of vertices increases, however, cliques are no-longer guaranteed. 
Our result shows that one can simultaneously embed trees, even as the number of vertices 
grows, as long as the sum of the tree sizes is at most d. 

Alternatively, one can ask whether a bound on the minimum degree is excessive to guar- 
antee the embedability of a forest. Indeed, we propose a natural extension to the conjecture 
by Erdos and Sos: 

Let F = {Ti, . . . , T p } be a forest, and d = Y7i=\ a i> where Oj is the number of 
edges in the tree (i e [1 , p] ) . Then every graph G with at least d + p vertices 
and the average degree > d contains a subgraph isomorphic to F. 

For a single star, the conjecture clearly holds; but, even the extension to a collection of stars 
is not clear. 
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